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Abstract 

For an improper complex signal x, its complementary covariance Exx^ is not zero and thus it carries 
useful statistical information about x. Widely linear processing exploits Hermitian and complementary 
covariance to improve performance. In this paper we extend the existing theory of widely linear complex 
Kalman filters (WLCKF) and unscented WLCKFs [H. We propose a WLCKF which can deal with 
more general dynamical models of complex-valued states and measurements than the WLCKFs in H]. 
The proposed WLCKF has an equivalency with the corresponding dual channel real KF. Our analytical 
and numerical results show the performance improvement of a WLCKF over a complex Kalman filter 
(CKF) that does not exploit complementary covariance. We also develop an unscented WLCKF which 
uses modified complex sigma points. The modified complex sigma points preserve complete first and 
second moments of complex signals, while the sigma points in U only carry the mean and Hermitian 
covariance, but not complementary covariance of complex signals. 

Index Terms 

complementary covariance, Kalman filter, sigma points, widely linear transformation, unscented 
Kalman filtering. 

L Introduction 

Complex signals are ubiquitous in science and engineering, arising as they do as complex representa- 
tions of two real channels or of two-dimensional fields. Consider a zero mean complex random vector 
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X. The usual covariance matrix defined as E-k-k^ describes its Hermitian second order covariance. But 
wlien X and its complex conjugate x* are correlated, the complementary covariance matrix E'xx^ does 
not vanish, so it carries useful second order information about the complex random vector x. We call a 
complex random vector proper as long as its complementary covariance matrix vanishes and improper 
otherwise. Proper complex vectors have a statistical description similar to real vectors, but improper 
random vectors do not. A comprehensive second order analysis of improper random vectors and processes 
is considered in |[2l-ll6l. 

For any improper random vector x, for which x is correlated with its complex conjugate x* , intuition 
suggests that a good estimator of x should depend on x*. This requires a methodology of widely linear 
processing instead of strictly linear processing ||2]- For random complex signals, the merit of widely linear 
processing has been exploited in various papers on estimation 111,1161, filtering HI, 131, lH, detection Q, 
l8l . and equalization It turns out that widely linear processing brings improvement in performance 
over strictly linear processing |@], lITOl when there is complementary covariance to be exploited. 

In the past few decades the reasoning of the Kalman filter fTTI has been modified to apply to nonlinear 
problems, producing Extended Kalman filters lT2l and Unscented Kalman filters fT3l . The motivation of 
this paper is to make use of widely linear processing to develop novel complex Kalman filters and their 
nonlinear versions for improper complex states. We show that for improper complex states, complementary 
covariance matrices may be used to create widely linear complex KFs (denoted WLCKFs) and Unscented 
WLCKFs. The key contributions of this paper are as follows: 

• From a linear real dual channel dynamical model we derive an equivalent widely linear complex 
single channel dynamical model, where the updates of random states and measurements depend on 
both states and noises and their conjugates. For the complex model we derive a WLCKF which is 
equivalent to the conventional KF for the dual channel model. The WLCKFs proposed in 01 consider 
special dual channel problems and their corresponding complex dynamical models. In these complex 
models the updates of complex random states and measurements do not depend on the conjugates 
of states and noises. 

• We compare the performance between the WLCKFs and conventional KFs. Our analytical and 
numerical results show that for some special distributions of states and noises, the MSE of the WL- 
CKF is significantly smaller than the MSE of a CKF that does not exploit non-zero complementary 
covariance. 

• For dynamical models with complex nonlinear state and measurement equations, we develop an 
Unscented WLCKF for which a systematic paradigm to construct modified complex sigma points is 
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Studied. The property of modified sigma points is that they preserve the complete first and second 
order statistical information of complex random vectors. The WLCKF of [H uses sigma points that 
only preserve the mean and Hermitian covariance, but not the complementary covariance of states. 



II. Brief Review of Complex Random Vectors 

Let Q be the sample space of a random experiment that generates two channels of real signals u, v G M" 
defined on fl. From this we construct the real composite random vector z G M^" as = [u^, v^], and 
the complex random vector x e C", obtained by composing u and v into its real and imaginary parts: 



X = u + jv. 

The complex augmented random vector x corresponding to x is defined as 



x^ = [x^ x^l 



(1) 



(2) 



From here the complex augmented random vector will always be underlined. It's easy to check that the 
real composite vector z and the complex augmented vector x are related as 



X = T„z. 



(3) 



The real-to-complex transformation T„ is 

I jl 
I jl 

which is unitary within a factor of 2: 

T„Tf = T^T„ = 2I. 

In fact, it is equation ^ that governs the equivalence between dual channel filtering for z and complex 
filtering for x. 

The augmented mean vector of the complex random vector x is 



(4) 



(5) 



and the augmented covariance matrix of x is 

R,., = ii;(x-/xJ(x-u )^ 



R * R * 



TR,,T^, 



(6) 



(V) 



where the matrix R^:^ = £'(x — /x^)(x — /x^)^ is the conventional Hermitian covariance matrix, and the 
matrix R^:^ = ii^(x — /i^)(x — /x^)^ is the complementary covariance matrix 
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Definition 1: If the complementary covariance matrix Yl^x is zero, then x is called proper; otherwise 
X is improper. 

The random vector x = u + jv is proper if and only if Huu = R-i>i> and Huv = —^uv where u and 
V are the real and imaginary parts of x respectively. 

III. Dual Channel Real and Widely-Linear Complex Kalman Filter 

Start with two real channels worth of random states ut,^rt G K". Denote zf = [uf ^^f] as the 
corresponding real composite state. Suppose the composite state and measurement equations are 



Ezt_i + Fut-i 
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and 



Gzt + TJt 
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, t = 0,l,... 



(9) 



where uj = [fxf crj'] and riJ = [pj 4>J] are the composite real driving and measurement noises, and 



xjx^ = [1^ kJ] is the composite measurement. This dynamical model allows the states and measurements 
on the respective real channels to be arbitrarily coupled. For the real composite vectors zj, Ut, t]^, and tp^, 
establish their complex augmented representations as Xf = [xf x^]^ = Tzj, = [w^ w^]-^ = Ta^j, 
= [yf y^]^ = T■^p^, and = [nf n^]-*" = Tr]^. Then the resulting augmented complex state and 



measurement equations are 



= Ax^-i +Bwi-i, t = l,2 
y, = C2^t+Rt, t = 0,l,... 



5 



where the augmented matrices A, B, and C are 



1. 
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Suppose the initial state has mean E'xg = 0, and augmented covariance 
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(11) 

(12) 
(13) 



Using the representation advocated in llT4l . the augmented second-order characterization of (xo)ilt)Ilt) 
is given by 
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, m, n > 0. 



(14) 
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We further assume that for n > m, Ew„:x.S, = and £'n„x^^ = 0, and for n > m, Ew„y^ = and 

iL IlL IL lit ' ''ii 

En^-y^ = 0. This is the same setup as that of the usual Kalman filter, but with covariances augmented 
to account for non-zero complementary covariance. 

Suppose the LMMSE estimator of Xf_i from measurements = (y^, , 

Then the LMMSE prediction of Xj from is 

E.t\t-i = Ax(_i|t_i, 

and the prediction of from Y^^ is 

y,i,_i = Cx,|,_i. 



(15) 



(16) 



Given the error covariance matrix P_f-_m_i for e^_l^^_'^ = ^t-i\t~i — 2£t-i> the error covariance matrix 



Pt|t-i for ei|i„i = Xi|i„i - Xi is 



(17) 



_P*|t_l 'Pt\t-1_ 

where Pt\t~i ^t\t-i the Hermitian and complementary error covariance respectively. The error 
covariance matrix S^i^^i for the innovation n^i^^i = y^<^_i — 



IS 



'^t\t-i - CPt|f_iC^ + R 



St|t-i ^t\t-i 



(18) 



where ^t\t~i ^"d St|t_i are the Hermitian and complementary innovation covariance respectively. The 
normal equation for the Kalman gain is 

KiS,|,_i=P,|,_iC^. (19) 
Thus the augmented Kalman gain may be written as 



(20) 



When complementary covariances 'Pt\t-i and S^i^.i vanish, and when C2 is zero, we have 

t\t 



diag{Kt,K*), where = Pi|<_iCf S~^_^ is the usual KF. Finally, the WLCKF is 



and the error covariance matrix for e^j^ = x^j^ — x^ is 



Ptii = a-K£)p,|,_i 



tt ^tt 



-p* p* 

t\t t\t 



(21) 



(22) 
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Finally, the WLCKF is implemented by initializing Xq|q = and Pqiq = Ho, and recursively running 
the procedure (fT5Tl- (|22T i. This WLCKF can be implemented in complex arithmetic, or it can be inverted 
for the real KF of the dual channel real model dH)-® by using real to complex connections ^ and ([TJ. 

Remark 1: In the state and measurement equations ([T0l)- ([T2] ). the new state depends on X(_i, x^_^, 
v^(_i, and w*„^. And measurement depends on x^, x^, n^, and nj'. For the state and measurement 
equations of the WLCKF proposed in UJ, the new state xj depends only on Xf_i and Wf_i, and 
measurement depends only on x^ and n^. Thus the WLCKF in [H can be obtained as a special case 
of the WLCKF considered here by letting matrices A2, B2, and C2 in ([T0l)-(fT2l) be zero, or equivalently 
assuming En = E22, E12 = -E21, Fn = F22, F12 = -F21, Gn = G22, and G12 = -G21 in the real 
channel equations dUl-®. 

Remark 2: An insightful interpretation of the widely linear KF is that the augmented Kalman gain 
determines the WLMMSE estimator of the prediction error et\t~i = — ^t\t~i from the innovation 
nf|t_i = yt — yt\t-i- Thus the widely linear complex KF reduces to the linear complex KF if and only 
if ej|t„i - Kjnt|j„i is orthogonal to n*|^_p where Kt = Pf_i|t_iC^S^yj^_^. That is, 

= Pi|t-iC"^ — Pt„m^iC^s^^^_^Sm_i 
= 0. 

One special case is that the error covariance of the predictor e^i^.i is proper, Pt|t_i = 0, and the 
innovation rif^t-i proper, Sj|^_^ = 0. This is true when ej_x|t-i' w^, and are all proper. Another 
special case is that nt\t~i is maximally improper, i.e., iit\t~i = (^'^t\t~i ^^^^ probability 1 for constant 
scalar \a\ = 1. This is irrespective of whether e^^t-i is improper. For instance, assume y^ = Re(xf) + nt 
with nt real. We know that yt is a noisy widely linear transformation of xt and yt|t-i = R£(^t|t-i)' 
meaning the innovation nt|t_i is real and hence maximally improper. One can also readily see that no 
widely linear processing is needed because yt is only dependent on the real part of xt. 

IV. Performance Comparison between WLCKF and CKF 
Let's suppose the state and measurement equations for a complex state xt are 

Xt = at^ixt^i + bt^iwt-i, t = l,2,..., (23) 

yt = ctxt + nt, t = 0, 1, (24) 
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1 (25) 
, t = 1,2 



where at,bt,ct E C for all t > 0. The augmented matrices are = diag{at,a^), = diag(bt,b^), 
and = diag{ct, c|). Then the recursion for the 2 by 2 augmented covariance matrix P^i^ is 

Thus the performance of the WLCKF is determined by the impropriety of the initial state xq through 
Hg, the driving noise wt through Q, and the measurement noise nt through R. In the following we show 
that for some special distributions of state and noises, the WLCKF produces smaller MSB than the CKF. 
Case 1: xq is improper, wt and nt are proper. 

Suppose Pqio = ( p"'" ) , Q = A''il, and R = Assume Pgio has eigenvalues {Aj*, A2}. Given 



-OlO — \ p* p > — ^ni, 111" XV, — iV2i. rtssuinc ^qIO 

the eigenvalues {A^~^, Ag"^} of matrix P(_i|t_i, the eigenvalues of P^j^ are 

A* = 5t(A*-^), i = l,2, 

where the function gt is given by 

|Q|^(|at|^A + lOt^iVi) + iV2 
Thus the eigenvalues {A^, A2} may be conveniently expressed as the function recursion 

A* = 5t°5<-io---°5i(A°) = gt(A0) 

Observe that gt is an increasing concave function w.rt A for each t. Thus we conclude that qt is concave 
for each t. Next we want to compute the widely linear minimum mean squared error (WLMMSE) at the 
t-th iteration for the WLCKF. This may be written 

er = E II et\t f= iHRtlt) = liQtiX'i) + qtiX^)). (26) 
Note the initial scalar Hermitian covariance is Pq^q. Then the t-th LMMSE for the CKF is 

To achieve the maximum performance improvement of the WLCKF over the CKF for the special case 
discussed here, we shall minimize with fixed Po|o ^^'^ variable Po|o- It '■^^^ be seen that at each t, 
is a Schur-concave function w.r.t all A?. Since A^ + A2 < 2Po|o B' the minimum is achieved when 

[A? A"] = [2Po|o 0]. (27) 

Substituting (l27l ) into (l26l ). we have the minimum 

miner = ^(9i(2Po|o)+9i(0)) 
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The ratio of min^^^ to is: 

_ ming,^ _ g,(2Po|o) + g,(0) 
2gt(Po|o) 

It's obvious that ^ < 9t < 1. This is because qt is concave, and (7j(2Po|o) + 9i(0) ^ 2gt(Po|o) for ^^y 
Po|o- Also (7t(2-Po|o) ^ 9t(^o|o) for ^^ly Po|o and qt{0) > 0. Actually the condition for achieving the 
lower bound is A'^i ^ ^ 1- This coincides with the MMSE analysis in Q. 
Case 2: xq is proper, wt and nt are improper. 

In this case we have Po|o = -Po|oL Q = p* ^ > and R = ^ p* ^ > where is the complex 
correlation coefficient between wt and , and /?„ is the complex correlation coefficient between nt and 
n|. These determine the level of impropriety. We can show that < |p«;|,|/On| < 1. Fig 1 plots the 
performance improvement of the WLCKF over the CKF at different level of impropriety of wt and nt- 
We choose Po|o = 1, a = 6 = c = l. The performance improvement is defined by ratio between the 
convergent MSB of the CKF over that of the WLCKF. As Fig. 1 illustrates, the performance improvement 
is monotone in |/3^| for fixed and monotone in for fixed \pui\- 

V. Unscented Widely Linear Kalman Filter 

In this section we consider the following nonlinear model for dual real channel state and measurement 
evolution: 

zt = [ut ^^tf = ft-i{zt-i,uJt-i) = fi-i([ut_i Yt.if, [fit <Ttf),t = 1,2,..., (28) 
V't = [^t i^tf = ht{zt,rjt) = ht{[ut ^rtf, [p, cl>tf),t = 0, 1,..., (29) 
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where ft-i and are time varying nonlinear transformations, and the notation for states and noises is 
identical with the model equations ([D-®. Then the induced complex model equations are 

xt = ft _ 1 (xj _ 1 , wj _ 1 ) , (30) 

yt = ht(xj,nt). (31) 

It can be seen that for all t, ft_i and ht are not widely linear transformations. Thus the WLCKF developed 
in section III cannot be directly utilized. For such a model, the extended WLCKF is proposed in ifTI . 
|[T5l to exploit the impropriety of complex states and noises. However, the major defect of the EWLCKF 
is that the posterior means and covariances are accurate only to the first order in a Taylor expansion. 
A conventional Unscented KF uses the unscented transformation (UT) to generate a fixed set of sigma 
points to represent the distribution of a random variable lITSl . After propagating sigma points through 
nonlinearities, the estimated posterior mean and covariance are precise at least to second order in a 
Taylor expansion. Motivated by the power of UKF, in this section we present a novel paradigm for 
constructing UWLCKFs. Our UWLCKFs use modified sigma points which preserve the Hermitian and 
complementary covariances of states and noises, while the UWLCKFs proposed in IH use sigma points 
which only preserve the Hermitian covariances of states and noises. 

Compose complex random states and noises into a complex vector = [x-^ n^]. Suppose the 
augmented mean and covariance of s are 

In m the authors proposed complex sigma points of s which are constructed from moments /x^ and Hgs- 
Thus these sigma points only carry /x^ and Hgs, but not Hgs- In fact, there may be multiple ways to 
generate sigma points for the augmented random vector s which carry both fi^ and R^^. But a hidden 
restriction imposed here is that these sigma points should be augmented vectors. Otherwise they cannot 
be propagated through the UWLCKF. One approach is to start with sigma points of the corresponding 
composite real random vector = [u^ /z^ cr^ cfp^]. The first and second moments of C, are 

/^C = ^T^V,, RCC = ^T^R,,T. (33) 

Using a Cholesky decomposition the composite covariance matrix R^^ may be factored as 

R(^(^ = BB^ 



R-.Q.*? R-.Q 



R* R* 



(32) 
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Denote the vector as the A;-th column of matrix B for /c = 1, 2, 2N . Then the sigma points {Z^} 
of (T are |[T3l 



Zk 



/x^, /c = 0, 

+ ^/2N + Abfc, A; = 1, 2Af, 



(34) 



Sfc = /^c - V2iV + Abfc_2jv, A: = 27V + 1, ...,47V, 

corresponding to the mean weights {VFm(fc)}|^Q and covariances weights {VFc(fc)}|^Q defined in |[T3l . 
Define a set of augmented vectors {-Yj^.} as 



TZu 



(35) 



M^, A; = 0, 

+ VWTATbfc, A; = 1,...,27V, 

V2NT\Thk-2N, A; = 2iV + l,...,4iV. 
We can show that all the compose the sigma points of the augmented vector s, since = M ^nd 



Tbi Tb2 



Tb 



2N 



Tbi Tb2 



Tb 



27V 



H 



Therefore we have obtained the sigma points {X_k} of s w.rt weights {Wm{k),Wc{k)} from widely 
linear transformation of the real composite sigma points {Z^} of C, w.r.t weights {Wm{k),Wc{k)}. 
Note that each sigma point is an augmented vector. Thus it follows that the complex set {Xk}, 
generated by extracting the top halves of is sufficient to capture both first and second order 

statistical information of the augmented random vector s. We call {X^} the modified sigma points of 
s. The impact of these modified sigma points is that {X^} preserves not only mean /x^ and Hermitian 
covariance R^^, but also complementary co variance R^^. 

Example 1 (Phase Demodulation Problem): Consider a scalar real random phase Ot that is updated as 



aOt-i + bw 



t-i, 



t = 1,2,..., 



(36) 



where wt is a real driving noise. So, the phase is real, and it evolves or jitters according to a first-order 
Markov sequence. The measurement in a quadrature demodulator is a noisy complex signal modulated 

by et. 

yt = e''^^+nt, t = 0,l,... (37) 

where each nt is assumed to be a zero mean, scalar complex Gaussian random variable |0] with Hermitian 
variance R and complementary variance R. The complex correlation coefficient between nt and n| is 
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(a) (b) (c) 

Fig. 2. Compari.son between UWLCKF and UKF. (a) Phase estimated by UWLCKF at each iteration, SNR = 30dB, |p| = 0.5. 
(b) Normalized estimation error ^ of UWLCKF and UKF vs SNRs, \p\ — 0.7. (c) Performance improvement r of UWLCKF 
over UKF vs impropriety of rit. 



p = ^ which describes the impropriety of nt. The signal-to-noise ratio at the receiver is SNR = R~ . 
In simulation we set a = 0.98, b = 0.05. Each wt is a standard mean zero and variance one Gaussian 
real random variable, independent of all others. 

Fig. 2(a) draws the outputs of the UWLCKF over time at SNR = 30dB and \p\ = 0.5. The UWLCKF 
is constructed according to Algorithm 3. The widely linear Kalman gain for the UWLCKF is a 2 by 2 
matrix and the estimate 9t\t is always real. It can be observed that for most iterations, the estimate 6t\t 
is close to the phase 6t. Also the true 9t is almost confined by the envelope Ot\t^ 

Fig. 2(b) compares the performances of the UWLCKF that accounts for the impropriety of the noise, and 
the UKF that assumes the noise to be proper. Unlike the UWLCKF above, the UKF estimates 6t from a real 
2 by 1 measurement vector consisting of the real and imaginary part of yt collected from dual channels. At 
each iteration the UKF produces sigma points from the real mean vector [9f\i jiyj pu pv]'^ = [dt\t 0]^ 
and covariance matrix M = diag{P^\^, 1, Ru,Rv), and it has a 1 by 2 Kalman gain vector. The complex 
correlation coefficient is \p\ = 0.7. Define the normalized squared error as ^ = ||e||2/||0||2> where and 
e are vectors consisting of phases and estimation errors in 500 iterations respectively. In the plot each 
^ is computed by averaging 1000 Monte-Carlo simulations. It can be seen that in the low-medium SNR 
regime, UWLCKF requires about 2dB less SNR than the UKF. 

Fig. 2(c) shows the performance improvement of the UWLCKF over the UKF vs the noise impropriety 
\p\ at different SNRs. We use the factor r = ^ukf/Cuwlckf to evaluate the advantage of UWLCKF. The 
normalized squared error ^ukf and ^uwlckf are defined as above. Each r is computed by averaging 1000 
Monte-Carlo simulations. For \p\ > 0.8, the gain r > 2. 
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VI. Conclusion 

In this paper we have designed widely linear and unscented WL complex Kalman filters for complex 
noisy dynamical systems with improper states and noises. We show that WLCKFs may significantly 
improve on the performance of a CKF that ignores corresponding covariance. A simulation for real phase 
demodulation shows how an UWLCKF produces real estimates from complex baseband measurements 
and shows the improvement of its performance over an unscented complex KF that assumes proper states 
and noises. 
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